Stack Overflow Query Outcome Prediction

نویسندگان

  • Robbie Jones
  • David Lin
چکیده

Stack Overflow’s core mission is to create an online encyclopedia for all programming knowledge. In order to ensure quality content in the face of rapid growth, community moderators frequently close low quality questions, often asked by newcomers. In order to alleviate moderator burden and ease newcomers’ transition, we devise two classifiers to predict 1) whether a question will be closed and if close 2) its reason for closure. We train our models using logistic regression, SVMs, and boosting before selecting the optimal classifier. We found that the adaptive boosting algorithm best classified whether a question would be closed, whereas lasso-regulated logistic regression best classified the reason for closure. Our next steps to improve our classifiers include using word vectors, splicing the data by time period, and extracting more features from code segments.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

StaQC: A Systematically Mined Question-Code Dataset from Stack Overflow

Stack Overflow (SO) has been a great source of natural language questions and their code solutions (i.e., question-code pairs), which are critical for many tasks including code retrieval and annotation. In most existing research, question-code pairs were collected heuristically and tend to have low quality. In this paper, we investigate a new problem of systematically mining question-code pairs...

متن کامل

Improving Stack Overflow Tag Prediction Using Eye Tracking

I) Goals and Purpose Software developers use Stack Overflow to post questions and answers related to programming and computer science problems they need to solve. Questions such as seeking input on some efficient and time-saving methods of coding a particular program, getting help on solving various bottlenecks in coding are commonly seen. When users submit questions on Stack Overflow they need...

متن کامل

Embedded Emotion-based Classification of Stack Overflow Questions Towards the Question Quality Prediction

Software developers often ask questions in Stack Overflow Q & A site, and their posted questions sometimes do not meet the standard guidelines. As a consequence, some of the questions are edited by expert users, some of them are down-voted, or some are even deleted permanently. Besides, the users (i.e., developers) might not get the expected solutions for their problems. In this paper, we study...

متن کامل

Advantages Of Object Relational Database Model

This article explores the differences between relational databases (RDBMS) and You should have looked into the property-graph model and optionally read especially for join heavy queries, the minutes to milliseconds advantage that Even object-relational mappers use SQL under the hood to talk to the database. When you write applications that communicate with a relational database, your created a ...

متن کامل

GitHub and Stack Overflow: Analyzing Developer Interests Across Multiple Social Collaborative Platforms

Increasingly, software developers are using a wide array of social collaborative platforms for software development and learning. In this work, we examined the similarities in developer’s interests within and across GitHub and Stack Overflow. Our study finds that developers share common interests in GitHub and Stack Overflow; on average, 39% of the GitHub repositories and Stack Overflow questio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016